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Tsar Reference Guide 



This Reference Guide describes the theory and 
methods behind the following techniques used in 
Tsar: 

• property calculations 

• Tsar statistics 

• data reduction techniques 

• clustering techniques 

• regression techniques. 



Property Calculations 



Overview 



The property calculations in Tsar derive a wide range of structural 
descriptors from the simple 2D and 3D structural information available for 
a structure. In addition, some calculations use atomic partial charges to 
derive further properties. ^ 

<t> Refer to Chapter 8, "Calculating Structure Properties" in the Tsar User 
Guide, for information about using Tsar to calculate structure properties, 

You can also import structure information directly from databases. Tsar is 
supplied with two standard databases of substituent properties taken from 
the published work of Hansch and Leo (aromatic . t sard and 
alipfcacic . tsard). A further database provided by Dr. van de 
Waterbeemd is stored as waterbeemd. taard 

<fr Refer to Chapter 5, "Managing Data" in the Tsar User Guide, for 
Y information about using Tsar to create your own databases and import 
information from databases. 
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Chapter I - Properry Calculations 



Structure property calculations 



The structure property calculations are accessed and defined using 
options from the Properties menu. The calculations use the 2D and 
3D structural information stored in the project to derive further 
properly descriptors, such as: 

• mass, surface area, volume 

• Verloop parameters 

• moments of inertia 

• dipoles 

• molar refractivity . 

• lipophiZicity, Iipoles 

• connectivity, shape, topology, and electrotopology indices 

• autocorrelograms ^ 

• similarity indices (Asp users only) 

• electrostatic parameters (Vamp users only) 

• toxicity calculations (TOPKAT QSTRs for Tsar users only). 

<t> Access online Help for Asp, Vamp, and TOPKAT QSTRs for Tsar 
by choosing Help] Online Reference I Asp/Vamp/TOPKAT 
QSTRs for Tsar from a Project Window. 

The Verloop parameter and molecular volume calculations use 
approximate numerical algorithms, which may produce very slightly 
different results for the same molecule in different orientations. 

The calculations make the following assumptions: 

• molecular mass, moments of inertia/principal ellipsoid, and 
autocorrelograra calculations assume mean atomic masses for 
constinient atoms (parameterized for elements up to atomic 
number 100) 
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Structure property calculations 



Verloop parameters, surface area, and volume calculations 
assume standard Van der Waals radii for each element 

surface area calculations use a probe radius of 1 .4 A. 
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Topological, connectivity, and shape indices 



Baiaban Indices, 



The Balaban index, 7, is the average-distance sum connectivity 
(Baiaban, 1982). 

For a connected molecular graph G, 



where: 

M - number of edges in G. 

jLt = cyclomatic number of G. On a 
poly cyclic graph, u is the minimum 
number of edges that must be 
removed before G becomes acyclic. 



and is as defined for 
(he Weiner index. 



Connectivity indices 5 




Htall and jCier (1992) have developed molecular ponnectivity indices 
(Chi) that reflect the atom identities, bonding environments and 
-numbex of bonding hydrogens. These Kier indices are consequently 
useful in a wider variety of applications (Hall and Kier, 1 992 detail 
xnany examples). 

-''''''^"^lecuiw'n^t'ar^drawn without hydrogen atoms can be 

:' t : V' decomposed into fragments of length m, which may be divided into 

, Afferent categories. Hall and Kiex; defined four series of fragment 
C ^'^egctti^: Path. Cluster. Fath/auste'r, and Ring. The spread :yid 
■ numbers of £agwient membexship for each category is determined v 
by molecule connectivity. 
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Etectrotopologlcal Indices calculated by Tsar 



•* ; x ■ : - - \ k/ v ^> . -You can use- the Topology options in Tsar to calculate the following 
indices: 

1c i -ci, - • y\<: - Sum- of Atoms calculate the sum of the states of all the individual 

-i ' atqms in the molecule or substiruent (see 
diagram). 



First Atom * 

calculate the state o£$e first 9Ubstto«at" 
- atom attached to the generic ipso atom 



' r •- 1 ; \* c I :y Y^t^^ ^ ie ' chD ?^n subst&ient ts-attached 



<^ulotelW>ym>f itt; elates of all the ;' 
fndki&^ator&df the chosen ■ 
suttstftueni (s\ibstitu6m 2) 




Substituent 2 



Diagram oka whole molecule with tm substituents 



Kier and Hall shape indices 



~* The Kappa mdexfkier and-Hall, 1992) is i molecule shape index 
' 1 based on the assumption that the shape of a molecule is a function of 
the number of atoms and their bonding relationship. This reference 
contain* a^ei^Ue^o^u^on of the applications of these indices, 
including a comparison with the Tafc Steric parameter. 

Notice that work was carried out on hydrogen suppressed or 
'skeleton' structures. 

Hall and Kier found tjiat for molecules with A atoms, a set of path 
variables m P could be defined: 

m P- the total number of paths along adjacent bonds of length m bond 
lengths/Tor eacli rnoleCTiler 

They found that molecules with dissimilar bonding patterns had 
different values of m P over the range of m. Also, that for each 
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Topological, connectivity, ttdshapem &ts 



combination of A and m. there was a maximum 
value of m P that, could be expressed iu terms of A. Tbenrfore «cn 
mo lecule i has a shape and a series of relied -P, values. «•<* of 
which has a value between the rwo«xtreroe» of shape. 

'mas- i mf ' 1 
where A is the same for all 3 stnictnres. ' 



Molecule 




, 6 15 Maximum possible value of 
'Pfor 6 atoms 



6 J ^ 



><"/ -.iC'.r 



6 ' , 5 ' Minimum possible value of 
v . ' J Pfor6atoms 



g^l'v Kappa Indices (k) ^ u 

EacbKa^mdexbbasedcraamathemalicalcx^ 

■h. ; . ratios between "P^ m Pi and -P.*. 

i|^^ ^^/f J&PP a * shows the degree of complexity of a bonding pattern. 



For any number of atoms A: 



(ACA-D)/2 



and 



Therefore, if 



1 l P V 

It* - 



TTaen 



, substituting taeequations for ^ and into the equation: 



#2* 



uUttUUMUUHiii 
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Chapter I - Property Calculations 



* A(A-l) 

• : (^) 

^--A for an acyclic structure . 
; ic > A for cyclic structures. 

Kappa 2 or ^ Kappa 2 indicates the degree of linearity or star-likeness of bonding 
part ems. 

In a similar way, Kappa 2 inay be defined as follows, for any 
■>;r!ii.-Hj >-■■:. number of atoms A' 

'" ;t ' ' ' * 

2 K= fA-l)(A^2) 2 - 

Kappa 3 Of 3 k KAPP a 3 indicates the degree of branching at the centre, of a 
molecule, larger for predominantly linear molecules with 
■ branching at the ends. . ; 

' Kappa 3 may be defined as follows, for any number of atoms A: 



VI. ■»*■*>. 



2 

3 K= . (A-1)(A "3) when A is odd 



3 2 
, . ... ( >;) 



3 KZ ^ (A-2) 3 (A^3^ When/Us cyefl 



3 2 
(ft).. 
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The basic Kappa indices were derived assuming that all atoms axe 
equivalent. Hall and Kier aiso defined a group of modified indices, 
the Kappa Alpha indices. 

The contribution of each atom to the overall shape of a molecule is 
based on a comparison >*ith a Carbon sp 3 atom. An alpha value Ls 
calculated for each atom type using the ratio of Covalent radii of 
C(sp 3 ) and the atom. 

For a molecule with nou-CCsp 3 ) atoms, the*Kappa indices are 
modified by replacing A with (A+a), where a is the sum of a* for all 
A. Therefore the Kappa Alpha equations are; 

1 x^= ((A + a)U + a-l) 2 )/(V f +a) 

\= «A + a-l)(A + ot-2) 2 )/( 2 P, + a)* 

3 x a = ( (A + a - 1 ) (A + a - 3)V( 3 P i + a) when A is od d 

3 K^= ((A+a-2) 2 (A + CC-3))/( P ( +Ct) when A is even 

In small molecules (A£4) some of the "VP quantities are not be 
defined, or are set to zero. Consequently, useful Kappa values cannot 
be calculated from the equations above. 
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Chapter I - property Calculations^ 



Flexibility or 4> 



Hall and Kiex extrapolated the expected indices for small mote^Uleisv 
using methods described in the 1992 reference: 



Ik r 4 l'oDO 


forraetbane " 3 k 


= 0.000 


for 








methane 


2 K =0.000 


for methane 


= 1.450 


for ethane 


,= 1,000 


forcthane • 


, = 2.000 


for 








propane 






= 3.378 


for butane 




Hall and Kiex do not discuss values for Kappa Alpha in small 
molecules. In Tsax» the Kappa Alpha values of small molecules are 
set to zex.c^unleas 0=0.0 (i.e., only Cjtsp 3 ) atoms), when they are set 
equal to th^e'Kappa indices. ^ 



Flexibility of a molecule is directly related to the degree of linearity,-, 
and the presence of cycles arioVor branching. The Kappa alpfca 
indices measure these factors while also taking the effects of atomic, 
identities on shape into account; 

Hall and Kiex found that by combining j k* and ^ indices, a further 
index <}>, which measured flexibility,, could be defined: 

For alkalies, $ is equal to the number "of 1 boriSs m tffe carbon si 
Branching and/or cyclisarion lead to a sharp decline in 4>. 




